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Maximum likelihood principle is shown to be the best mea- 
sure for relating the experimental data with the predictions 
of quantum theory. 



Quantum theory describes events on the most funda- 
mental level currently available. The synthesis of infor- 
mation from mutually incompatible quantum measure- 
ments plays the key role in testing the structure of the 
theory. The purpose of this Letter is to show a unique 
relationship between quantum theory and the mathemat- 
ical statistics used to obtain optimal information from 
incompatible observations: Quantum theory prefers the 
relative entropy (maximum likelihood principle) as the 
proper measure for evaluation of the distance between 
measured data and probabilities defined by quantum the- 
ory. 

In the standard textbooks |l| , a quantum measurement 
is represented by a hermitian operator A, whose spec- 
trum determines the possible results of the measurement 
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In the following, the Dirac notation will be used and 
for the sake of simplicity a discrete spectrum will be as- 
sumed. Eigenstates are orthogonal {a\a') = 5aa' and the 
corresponding projectors provide the closure relation 
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Projectors predict the probability for detecting a partic- 
ular value of the q-variable a represented by the operator 
A as Pa = {a\p\a), provided that the system has been pre- 
pared in a quantum state p. This mathematical picture 
corresponds to the experimental reality in the following 
sense: When the measurement represented by the op- 
erator A is repeated N times on identical copies of the 
system, the number a particular output a is collected Na 
times. The relative frequencies fa = ^ will sample the 
true probability as fa Pa fluctuating around them. 
The exact values are reproduced only in the asymptoti- 
cal limit N oo. Experimentalist's knowledge may be 
expressed in the form of a diagonal density matrix 



provided that error bars of the order 1 /N are associated 
with the sampled relative frequencies. This should be 
understood as mere rewriting of the experimental data 
{N, Na}- Similar knowledge may be obtained by observa- 
tions, which can be parameterized by operators diagonal 
in the \a) basis, i.e. by operators commuting with opera- 
tor A. But the possible measurement of non-commuting 
operators yields new information, which cannot be de- 
rived from the measurement of A. 

From this viewpoint it seems to be advantageous 
to consider the sequential synthesis of various non- 
commuting observables. In this case several operators 
Aj, j = 1,2, .. . will be measured by probing of the system 
N times together. Now, one expects to gain more than 
just the knowledge of the diagonal elements of the den- 
sity matrix in some a priori given basis. This sequential 
measurement of non-commuting observables should be 
distinguished from the similar problem of approximate si- 
multaneous measurement of non-commuting observables 
Q. As in the case of the measurement of a hermitian 
operators, the result of sequential measurements of non- 
commuting operators may be represented by a series of 
projectors \yi){yi\. This should be accompanied by rela- 
tive frequencies fi indicating how many times a particular 
output i has been registered, J2ifi — 1- Various states 
need not be orthogonal {yi\yj) ^ Sij, in contrast to the 
previous case of a hermitian operator. However, this sub- 
stantial difference has its deep consequences. The result 
of the measurement cannot be meaningfully represented 
in the same manner as previously. For example, direct 
linking of probabilities with relative frequencies used in 
standard reconstructions |||,^ p.^ = fi, pu = {yi\p\yi), 
may appear as inconsistent, since the system of linear 
equation is overdetermined, in general. 

A novel approach will be suggested here. Let us 
assume the existence of a quantum measure F{pii\fi), 
which parameterizes the distance between measured data 
and probabilities predicted by quantum theory. We will 
search for the state(s) located in the closest neighbor- 
hood of the data. A general state may be parameterized 
in its diagonal basis as 
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The equation for the extremal states may be found anal- 
ogously to the treatment developed in for maximum 
likelihood estimation as 
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where A is a Lagrange multiplier. The normalization con- 
dition Trp = 1 sets its value to 

Any composed function G{F{pii\fi)) fulfills the same ex- 
tremal equation (|^) with the Lagrange multiplier rescaled 
as A^. Without loss of generality it is therefore enough 
to consider the normalization condition A = 1. 

The extremal equation (||) has the form of a decom- 
position of the identity operator on the subspace, where 
the density matrix is defined by 
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This resembles the definition of a probability- valued op- 
erator measure (POVM) characterizing a generalized 
measurement . To link the above extremalization with 
quantum theory, let us postulate the natural condition 
for the quantum expectation value 
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This assumption seems to be reasonable. The synthesis of 
sequential non-compatible observations may be regarded 
as a new measurement scheme, namely the measurement 
of the quantum state. 

The quantum measure F then fulfills the differential 
equation 



dF 
dpii 



Pa — fi- 



and singles out the solution in the form 



F{Pit\fi) = ^ft^'apv. 
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This is nothing else than the log likelihood or KuUback- 
Leibler relative information |Q. Formal requirements of 
quantum theory, namely the interpretation of the ex- 
tremal equation as a POVM, result in the concept of max- 
imum likelihood in mathematical statistics. The analogy 
between the standard quantum measurement associated 
with a single hermitian operator, and a series of sequen- 
tial measurements associated with many non-commuting 
operators is apparent now. The former determines the 
diagonal elements in the basis of orthonormal eigenvec- 
tors, whereas the latter estimates not only the diago- 
nal elements, but the diagonalizing basis itself. This is 
the difference between measurement of quantum observ- 
able A and measurement of the quantum state. In this 
sense maximum likelihood estimation may be considered 



as a new quantum measurement. The observed quantum 
state is given by the solution of the nonlinear operator 
equation 
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which is, in fact, the completeness relation of a POVM 
with measured outputs {fi}. Special cases of the solu- 
tion have been discussed recently for thephase 
the diagonal elements of the density matrix [p| and the 
reconstruction of the 1/2 spin state 10 1. 

Quantum interpretation offers a new viewpoint on the 
maximum likelihood estimation. This method is custom- 
arily considered as just one of many estimation meth- 
ods, unfortunately one of the most complicated ones. It 
is often considered as a subjective method, since like- 
lihood quantifies the degree of belief in a certain hy- 
pothesis. Any physicist, an experimentalist above all, 
would perhaps use as the first choice the least-squares 
method for fitting theory and data Q]. Let us evalu- 
ate this as an illustrative counterexample. In this case 
F{pii\fi) ~ '^i{Pii~ fiY E^nd the extremal equation reads 



2 5Z(^" ~ f^)\yi)(y-^\p = ^P^ 

i 

A = 2^(pii - fi)pii. 
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Equation (g_lj) may be again interpreted as a complete- 
ness relation for the POVM 

However the expectation value is a rather complicated 
implicit function of the measured data, since 
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It does not mean that the least -squares method is in- 
correct, it only means that such fitting does not reveal 
the structure of quantum measurement. In this sense the 
maximum likelihood seems to be unique and exceptional. 

There are several fundamental consequences of this re- 
sult. According to Fisher's theorem pi] , maximum like- 
lihood estimation is unbiased and achieves the Cramer- 
Rao bound asymptotically for large — > oo. As demon- 
strated here, for any finite N maximum likelihood may be 
interpreted as a quantum measurement. When seen this 
way, bias and the noise above the Cramer-Rao bound 
seem to be unpleasant but natural properties of quan- 
tum systems. Maximum likelihood may set new bounds 
on distinguishability related currently to the Fisher infor- 
mation [ p2[ . Fisher information corresponds to the Rie- 
mannian distinguishability metrics and may be naturally 
interpreted as the distance in the Hilbert space. Besides 
this, fundamental equations of quantum theory such as 
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Schrodinger, Klein-Gordon, Pauli etc. and other physi- 
cal laws may be derived from the principle of minimum 
Fisher information ||l3|,^ . Since Fisher information only 
approximates the behaviour of likelihood in asymptoti- 
cal limit, all these features seem to be involved also in 
the maximum likelihood principle. However, the latter 
one is obviously stronger since, as shown here, only the 
maximum likelihood is able to reproduce the structure of 
quantum measurement for finite observations. 

Notice also that maximum likelihood generalizes the 
notion of POVM in the following sense. Actual mea- 
surements may be (and usually always are) incomplete. 
However, the synthesis of any incomplete measurements, 
namely of the original observations represented here by 
\yi)(yi\, is complete in the subspace, where the resolution 
of the identity (j^) holds. POVM and estimated quan- 
tum state are mutually connected in dependence on the 
type of measurement and on its results. In particular, it 
is not necessary to consider only the special scheme for 
quantum state observation as for example the mutually 
complementary eigenbases |15 . 

We conclude with a remark that may shed light on 
why maximum likelihood is peculiar: Maximum likeli- 
hood is perhaps singled out by Nature, because the non- 
symmetric fluctuations of data of multinomial distribu- 
tions, which are the results of quantum measurements, 
are compensated, so to say, by an equally non-symmetric 
attribution of degrees of belief to the various test states. 
Maximum likelihood takes into account that, in finite 
observations, improbable events tend to appear more fre- 
quently than they should, and conversely, very probable 
events tend to appear somewhat less often. 
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